AITopics | Crittenden County

Collaborating Authors

Crittenden County

A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation

Zhou, Shijie, Zhang, Ruiyi, Zhou, Yufan, Chen, Changyou

arXiv.org Artificial IntelligenceDec-20-2024

Large multimodal models still struggle with text-rich images because of inadequate training data. Self-Instruct provides an annotation-free way for generating instruction data, but its quality is poor, as multimodal alignment remains a hurdle even for the largest models. In this work, we propose LLaVAR-2, to enhance multimodal alignment for text-rich images through hybrid instruction generation between human annotators and large language models. Specifically, it involves detailed image captions from human annotators, followed by the use of these annotations in tailored text prompts for GPT-4o to curate a dataset. It also implements several mechanisms to filter out low-quality data, and the resulting dataset comprises 424k high-quality pairs of instructions. Empirical results show that models fine-tuned on this dataset exhibit impressive enhancements over those trained with self-instruct data.

caption, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2412.16364

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > Arkansas > Crittenden County > West Memphis (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

The Drone Center's Weekly Roundup: 2/27/17

RobohubFeb-27-2017, 17:01:52 GMT

Dronescapes is a collection of vibrant, mystical paintings of drones by Australian artist Kathryn Brimblecombe-Fox. In a conversation with the Center for the Study of the Drone, the artist shares the meaning of her work, explains her use of traditional Australian motifs, and shares her views on the rise of autonomous technology. The Federal Aviation Administration released a new set of reports of airspace incidents involving drones, including close encounters with manned aircraft and drone use over airports. The dataset includes 1,274 reported incidents that occurred between February and September 2016, around 400 more than occurred during the same period in 2015. At the National Interest, Elsa Kania argues that China could soon overtake the U.S. in the development of autonomous drones.

artificial intelligence, drone, unmanned ground vehicle, (14 more...)

Robohub

Country:

Asia > China (0.26)
Asia > Russia (0.15)
North America > United States > New York > New York County > New York City (0.05)
(16 more...)

Industry:

Transportation > Air (1.00)
Information Technology > Robotics & Automation (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback